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ABSTRACT We have previously described a set of human 
T-lymphocyte-specific cDNA clones isolated by a modified 
differential screening procedure. Apparent full-length cDNAs 
containing the sequences of 14 of the 16 initial isolates were 
sequenced and were found to represent five different species of 
mRNA; three of the five species were identical to previously 
reported cDNA sequences of preproenkephalin, T-cell-replac- 
ing factor, and a serine esterase, respectively. The other two 
species, 4-1BB and L2G25B, were inducible sequences found in 
mRNA from both a cytolytic T-lymphocyte and a helper 
T-lymphocyte clone and were not previously described in T-cell 
mRNA; these mRNA sequences encode peptides of 256 and 92 
amino acids, respectively. Both peptides contain putative 
leader sequences. The protein encoded by 4-1BB also has a 
potential membrane anchor segment and other features also 
seen in known receptor proteins. 

Most T-cell factors have been classically identified by recog- 
nizing biologic activities in assays, purifying the protein, and 
then isolating the corresponding genes using bioactivities and 
protein information. An alternative approach is to isolate 
putative T-cell genes based upon specific expression and then 
to demonstrate the function of the unknown molecule (1-3). 
This laboratory has recently cloned a series of T-cell subset- 
specific cDNAs from cloned helper T-lymphocyte (HTL) L2 
cells and cloned cytolytic T-lymphocyte (CTL) L3 cells by 
employing a modified differential screening procedure (4). 
Subsequently, we isolated and sequenced apparent full-length 
cDN As corresponding to 14 of the original clones representing 
five different species of mRNA. Here we summarize our 
studies and present the sequences of two cDNAs that do not 
correspond to previously reported T-cell mRNA species.§ 

MATERIALS AND METHODS 

Cells. We are grateful to F. Fitch of the University of 
Chicago for supplying us with the T-cell clones, L2 and L3 
cells (5). Cloned murine CTL L3 cells are Thy-1,2 + , Lyt-2 + , 
LFA-1 + , L3T4", and H-2L d -reactive. Cloned murine HTL 
L2 cells are Thy-1,2 + , LFA-1\ Lyt-2", L3T4 + , and Mls a/d - 
reactive. 

Before the isolation of RN A L2 cells were stimulated with 
Con A (10 ftg/ml) for 14 hr at a cell concentration of lOMO 7 
cells per ml. L3 cells were stimulated with Con A (2 /-tg/ml) 
for 14 hr at a cell concentration of 2.5 x 10 6 cells per ml. 
Mouse thymoma EL-4 cells (6) were stimulated with phorbol 
12-myristate 13-acetate (10 ng/ml) at a cell concentration of 
1.0 x 10 6 cells per ml for 20 hr; stimulation was monitored by 
interleukin-2 assay (7). RNA was isolated from the unstim- 
ulated B-cell lymphoma K46 (8) and rat natural killer cell 
large granular lymphocytes (9). 
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Isolation of T-Cell-Specific cDNA Clones. We have previ- 
ously isolated a group of cDN As that are specific for T cells 
in contrast with B cells, employing both positive and negative 
differential screening and RNA blot analysis of various 
lymphoid cells. The T-cell-specific cDNAs were further 
studied to determine whether they were specific for cloned 
HTL L2 cells or cloned CTL L3 cells. This allowed us to 
isolate T-cell-subset transcripts that exist at a low level (4). 
The two cDNA sequences, designated as L2G25B and 
4-1BB, were isolated in this fashion and were referred to as 
L2G25#4 and L3G29#4, respectively, in an earlier report (4). 

RNA and DNA Blot Hybridization. Total cytoplasmic RNA 
or poly(A) + RNA was fractionated on 1.2% agarose-form- 
aldehyde gels and transferred to GeneScreenP/ws (NEN). 
Gel-purified cDNA inserts were 32 P-labeled by nick- 
translation and used as probes. When a Northern (RNA) blot 
of GeneScreenP/ws was used multiple times for hybridiza- 
tion, the previous probe was removed by placing the mem- 
brane in 10 mM Tris-HCl (pH 7.0)/0.2% sodium dodecyl 
sulfate (SDS) at 85°C for 1 hr. 

High-molecular weight DNA of mouse spleens was pre- 
pared as described (10). Restriction endonuclease digests of 
DNA were electrophoresed in 0.8% agarose gel at 4°C. The 
DNA was denatured and transferred to GeneScreenP/ws as 
described by Southern (11). The blot was hybridized with 
32 P-labeled cDNA. 

Screening of cDNA Library and DNA Sequencing. L2 and 
L3 cDNA libraries that had previously been prepared (4) 
were rescreened with cDNA inserts of each of the 14 
T-cell-specific cDN A fragments. Typically 10 positive clones 
were chosen for each species, and the sizes of cDNA inserts 
were determined. The longest cDNA inserts were employed 
for nucleotide sequencing by the dideoxynucleotide chain- 
termination technique (12) employing Sequenase (United 
States Biochemical), with modifications made to accommo- 
date 2'-deoxyadenosine 5'[a- 35 S]thiotriphosphate (13). 

Nucleotide and Protein Sequence Comparison. Full-length 
cDNA and predicted protein sequences were compared with 
the sequences in the National Institutes of Health GenBank 
Genetic Sequence Databank (Release 56), European Molec- 
ular Biology Laboratories (Release 16.0), and National Bio- 
medical Research Foundation (Release 17.0). Predicted pro- 
teins were analyzed by the pepplot program. 

RESULTS 

Table 1 summarizes T-cell cDNA identified from 14-hr Con 
A-stimulated L2 and L3 cDNA libraries. Besides the cDNAs 
listed in the table, granulocyte/macrophage colony stimulat- 
ing factor, interleukin 2, interleukin 3, T-cell antigen receptor 
a- and /3-chain, and c-myc cDNAs were identified by cross- 
Abbreviations: CTL, cytolytic T lymphocyte; HTL, helper T lympho- 
cyte. 

'The sequences reported in this paper are being deposited in the 
EMBL/GenBank data base (accession nos. J04491 and J ^ 9 2J*q^2 
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Table 1. Summary of cDNA clones identified 



C..11 

rull- 


cUWA clone 






length 


isolated 


opeciiicuy 




CUNA 


previously* 


oi expression 


iQenuiicaiion 


4-1BB 


L3G29#4 


L2 and L3 


Unknown 




L3G25#4 


L2 and L3 






L3G14#2 






L2G25B 


L2G25#4 


L2 and L3 


Unknown (related 




L2G95#4 




to cDNA PLD78) 




L2G53#3 








L2G95#3 






L2S35 


L2S35#3 


L2 only 


Preproenkephalin 




L2PBK671 






8-1R 


L2PBK791 


L2 and EL-4 


T-cell-replacing 




L2PBK642 




factor 




L2PBK631 






L3G10 


L3G10#6 


L3 only 


HF gene (serine 




L3G18#3 




esterase) 


NDt 


L3G7#1 


L3 and EL-4 


Unknown 


NDt 


L3G26#1 


L3 and EL-4 


Unknown 



*cDNA clones were isolated independently and described as sepa- 
rate clones in the previous report (4). 
tFull-length version of the two clones was not isolated. 



hybridization of T-cell-enriched cDNAs with the correspond- 
ing full-length cDNA provided by other laboratories. 

Among the 16 unidentified T-cell cDN A clones isolated by 
this screen, two clones represented preproenkephalin that 
was identical to the sequence reported by Zurawski et ai 
(14), three clones were T-cell-replacing factor (15), and two 
clones represented a T-cell serine esterase gene (16). 

In the previous studies (4) we had also isolated an addi- 
tional 13 L3 cDNAs whose specificity could not be assigned 
by RNA blot analysis because of the weak signals. One of 
them (L3-1) was 64% homologous to a T-cell serine esterase 
(17). The sequence has been reported as a new member of the 
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T-cell serine esterase family (18). The other 12 clones have 
not yet been analyzed. 

Four species were from different regions of a cDNA 
referred to as L2G25B (800 base pairs). L2G25B was homol- 
ogous to a human cDNA PLD78 (19) of unknown function. 
Three species (L3G29#4, L3G25#4, and L3G14#2) were 
from different regions of 4-1BB (2400 base pairs). We have 
found ho reports of sequences homologous to 4-1BB. 
L3G7#1 and L3G26#1 were not characterized vigorously 
because they were represented at low levels in T-cell mRNA 
and were not inducible. 

Fig. 1A shows the nucleotide and deduced amino acid 
sequence of the longest open reading frame of L2G25B. The 
open reading frame codes for 92 amino acids including a 
putative signal sequence. The deduced sequence of the first 
23 amino acid residues has characteristics of the signal 
peptide of secretory proteins (23, 39). Therefore the mature 
protein is composed of 69 amino acids with a M T of 7880. No 
potential N-glycosylation sites or transmembrane-like do- 
main exists. The 3 '-untranslated region has repeated occur- 
rences of the sequence AUUUA (40) characteristic of lym- 
phokines and other rapidly degraded mRNA. 

An optimum alignment between L2G25B and PLD78 is 
shown in Fig. 15. The identity of amino acids between the 
two proteins was ~80%. The evolutionary conservation of 
these molecules from mouse to human may indicate that they 
play an important role in T-cell function. Recently a family of 
lymphokine genes termed RANTES (regulated upon activa- 
tion, normal T expressed, and presumably secreted) has been 
defined to include PLD78 (20). After the present manuscript 
was prepared, a report appeared describing a macrophage 
inflammatory protein (21); the predicted amino acid sequence 
of this monokine is identical to that of L2G25B. 

The nucleotide sequence of three overlapping cDNA 
clones represented by 4-1BB was determined (Fig. 2). The 
nucleotide sequence of 4-1BB revealed a single long open 
reading frame, beginning with the ATG codon at nucleotide 



-47 TTTTCTG TTCTGCTGAC AAGCTCACCC TCTGTCACCT GCTCAACATC 

1 ATG AAG GTC TCC ACC ACT GCC CTT GCT GTT CTT CTC TGT ACC ATG ACA CTC TGC AAC CAA 

1 Met Lys Val Ser Thr Thr Ala Leu Ala Val Leu Leu Cys Thr Met Thr Leu Cys Asn Gin 

61 GTC TTC TCA GCG CCA TAT GGA GCT GAC ACC CCG ACT GCC TGC TGC TTC TCC TAC AGC CGG 

21 Val Phe Ser Ala Pro Tyr Gly Ala Asp Thr Pro Thr Ala Cys Cys Phe Ser Tyr Ser Arg 

121 AAG ATT CCA CGC CAA TTC ATC GTT GAC TAT TTT GAA ACC AGC AGC CTT TGC TCC CAG CCA 

41 Lys lie Pro Arg Gin Phe lie Val Asp Tyr Phe Glu Thr Ser Ser Leu Cys Ser Gin Pro 

181 GGT GTC ATT TTC CTG ACT AAG AGA AAC CGG CAG ATC TGC GCT GAC TCC AAA GAG ACC TGG 

61 Gly Val lie Phe Leu Thr Lys Arg Asn Arg Gin He Cys Ala Asp Ser Lys Glu Thr Trp 

241 GTC CAA GAA TAC ATC ACT GAC CTG GAA CTG AAT GCC TGA GAG TCT TGG AGG CAG CGA GGA 

81 Val Gin Glu Tyr He Thr Asp Leu Glu Leu Asn Ala 

301 ACC CCC CAA ACC TCC ATG GGT CCC GTG TAG AGC AGG GGC TTG AGC CCC GGA ACA TTC CTG 

361 CCA CCT GCA TAG CTC CAT CTC CTA TAA GCT GTT TGC TGC CAA GTA GCC ACA TCG AGG GAC 

421 TCT TCA CTT GAA ATT TTA TTT AAT TTA ATC CTA TTG GTT TAA TAC TAT TTA ATT TTG TAA 

481 TTT ATT TTA TTG TCA TAC TTG TAT TTG TGA CTA TTT ATT CTG AAA GAC TTC AGG ACA CGT 

541 TCC TCA ACC CCC ATC TCC CTC CCA GTT GGT CAC ACT GTT TGG TGA CAG CTA TTC TAG GTA 

601 GAC ATG ATG AC A AAG TCA TGA ACT GAC AAA TGT ACA ATA GAT GCT TTG TTT ATA CCA GAG 

661 AAG T ftA TAA A> TA TGC CCT TTA ACA ACT GAA AAA AAA 



60 
20 



120 
40 



180 
60 



240 
80 



300 



360 
420 
480 
540 
600 
660 



L2G25B (1) Met Lys Val Ser Thr Thr 

* * * * 
PLD78 (1) Met Gin Val Ser Thr Ala 

Val Phe Ser Ala Pro Tyr 
* * * 

- Phe Ser Ala Ser Leu 
A 

Arg Lys lie Pro Arg Gin 

* * * * 
Arg Gin He Pro - Gin 

Gin Pro Gly Val He Phe 

***** 
Lys Pro Gly Val He Phe 

Glu Thr Trp Val Gin Glu 

* * * * 

Glu Glu Trp Val Gin Lys 



Ala Leu Ala Val Leu 

***** 
Ala Leu Ala Val Leu 

Gly Ala Asp Thr Pro 



Leu Cys 

* * 
Leu Cys 

Thr Ala 



Ala Ala Asp Thr Pro Thr Ala 



- Phe He 
* * 
Asn Phe He 

Leu Thr Lys 

* * * 
Leu Thr Lys 

Tyr He Thr 

* + ♦ 
Tyr Val Ser 



Val Asp Tyr Phe 

+ * * * 
Ala Asp 

Arg Asn 
* 

Arg Ser 

Asp Leu Glu Leu 

* * * * 
Asp Leu Glu Leu 



Tyr Phe 

Arg Gin 

* * 
Arg Gin 



Thr Met 

* * 
Thr Met 

Cys Cys 

* * 

Cys Cys 

Glu Thr 

* * 
Glu Thr 

He Cys 
+ * 
Val Cys 

Asn Ala 
* 

Ser Ala 



Thr Leu 
* 

Ala Leu 

Phe Ser 

* * 
Phe Ser 

Ser Ser 

* * 
Ser Ser 

Ala Asp 

* * 
Ala Asp 

(stop) 

(stop) 



Cys Asn Gin 

* * * 
Cys Asn Gin 

Tyr - Ser 

* * 
Tyr Thr Ser 

Leu Cys Ser 

* * 
Gin Cys Ser 

- Ser Lys 
* 

Pro Ser - 

(92) 
(92) 



Fig. 1. (A) Nucleotide sequence of 
L2G25B and the deduced amino acid se- 
quence. Nucleotide sequence of the message 
strand is numbered in the 5' and 3' direction. 
The 5' noncoding sequence is indicated by 
negative numbers. Nucleotide residue 1 is 
the first nucleotide of the ATG initiation 
codon. The predicted amino acid sequence is 
shown below. Potential signal peptide is 
underlined, consensus polyadenylylation 
signal is boxed, and stop codon is indicated 
by — . (B) Optimum alignment between 
L2G25B and PLD78-deduced amino acids. 
Homology search revealed that the amino 
acid sequence of L2G25B showed an exten- 
sive homology with a reported human se- 
quence, PLD78 (15) of unknown function, 
whose expression is inducible by phorbol 
12-myristate 13-acetate in human tonsillar 
lymphocytes. The identity of amino acids 
between the two proteins was «80%. *, 
Identical amino acids in these proteins; +, 
chemically similar amino acids found in both 
sequences; A, a potential cleavage site of the 
signa. peptide. RWON000043 
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-145 

-140 CATGAACTGC TGAGTGGATA AACAGCACGG GATATCTCTG TCTAAAGGAA TATTACTACA CCAGGAAAAG 

-70 GACACATTCG ACAACAGGAA AGGAGCCTGT CACAGAAAAC CACAGTGTCC TGTGCATGTG ACATTTCGCC 

1 ATG GGA AAC AAC TGT TAC AAC GTG GTG GTC ATT GTG CTG CTG CTA GTG GGC TGT GAG AAG 60 

1 Met Gly Asn Asn Cys Tyr Asn Val Val Val He Val Leu Leu Leu Val Gly Cys Glu Lys 20 



61 
21 



GTG GGA GCC GTG CAG AAC TCC TGT GAT AAC TGT CAG CCT GGT ACT TTC TGC AGA AAA TAC 120 
Val Gly Ala Val Gin Asn Ser Cys Asp Asn Cys Gin Pro Gly Thr Phe Cys Arg Lys Tyr 40 



121 
41 



AAT CCA GTC TGC AAG \GC TGC CCT CCA ACT ACC TTC TCC AGC ATA GGT GGA CAG CCG AAC 180 
Asn Pro Val Cys Lys &er Cys Pro Pro Ser Thr Phe Ser Ser He Gly Gly Gin Pro Asn 60 



181 
61 



TGT AAC ATC TGC AGA GTG TGT GCA GGC TAT TTC AGG TTC AAG AAG TTT TGC TCC TCT ACC 240 
Cys Asn He Cys Arg Val Cys Ala Gly Tyr Phe Arg Phe Lys Lys Phe Cys Ser Ser Thr 80 



241 
81 



CAC AAC GCG GAG TGT GAG TGC ATT GAA GGA TTC CAT TGC TTG GGG CCA CAG TGC ACC AGA 300 
His Asn Ala Glu Cys Glu Cys He Glu Gly Phe His Cys Leu Gly Pro Gin Cys Thr Arg 100 



301 
101 



TGT GAA AAG GAC TGC AGG CCT GGC CAG GAG CTA ACG AAG CAG GGT TGC AAA ACC TGT AGC 360 
Cys Glu Lys Asp Cys Arg Pro Gly Gin Glu Leu Thr Lys Gin Gly Cys Lys Thr Cys Ser 120 



361 
121 



TTG GGA ACA TTT AAT GAC CAG AAC GGT ACT GGC GTC TGT CGA CCC TGG ACG AAC TGC TCT 420 
Leu Gly Thr Phe Asn Asp Gin Asn Gly Thr Gly Val Cys Arg Pro Trp Thr Asn Cys Ser 140 



421 
141 



CTA GAC GGA AGG TCT GTG CTT AAG ACC GGG ACC ACG GAG AAG GAC GTG GTG TGT GGA CCC 480 
Leu Asp Gly Arg Ser Val Leu Lys Thr Gly Thr Thr Glu Lys Asp Val Val Cys Gly Pro 160 



481 
161 



CCT GTG GTG AGC TTC TCT CCC AGT ACC ACC ATT TCT GTG ACT CCA GAG GGA GGA CCA GGA 540 
Pro Val Val Ser Phe Ser Pro Ser Thr Thr He Ser Val Thr Pro Glu Gly Gly Pro Gly 180 



541 GGG CAC TCC TTG CAG GTC CTT ACC TTG TTC CTG GCG CTG ACA TCG GCT TTG CTG CTG GCC 600 
181 Gly His Ser Leu Gin Val Leu Thr Leu Phe Leu Ala Leu Thr Ser Ala Leu Leu Leu Ala 200 



601 CTG ATC TTC ATT ACT CTC CTG TTC TCT GTG CTC AAA TGG ATC AGG AAA AAA TTC CCC CAC 660 
201 Leu He Phe He Thr Leu Leu Phe Ser Val Leu Lys Trp He Arg Lys Lys Phe Pro His 220 



661 ATA TTC AAG CAA CCA TTT AAG AAG ACC ACT GGA GCA GCT CAA GAG GAA GAT GCT TGT AGC 720 

221 He Phe Lys Gin Pro Phe Lys Lys Thr Thr Gly Ala Ala Gin Glu Glu Asp Ala Cys Ser 240 

721 TGC CGA TGT CCA CAG GAA GAA GAA GGA GGA GGA GGA GGC TAT GAG CTG TGA TGTACTATC 780 

241 Cys Arg Cys Pro Gin Glu Glu Glu Gly Gly Gly Gly Gly Tyr Glu Leu 



781 


CTAGGAGATG 


TGTGGGCCGA 


851 


CCACCACCCT 


GTTCTTACAC 


921 


GCTAACATAT 


TTGTCTTTAC 


991 


CCTGCCTGTA 


TGCACACGTG 


1061 


AAGGGTTGGT 


TCCATAAGAA 


1131 


TGTCTTCTTA 


TTTTAACGTG 


1201 


GACACCCTTC 


TAGTTAATGA 


1271 


TATATGTATA 


TATAAGACTC 


1341 


TGGACATTTT 


ACGTCACACA 


1411 


TCTACGTCAT 


ATAATGGGAT 


1481 


CTACCCCTTC 


TGGGTACGTA 


1551 


AGTTCCCGGA 


CGAAGAGGAC 


1621 


TTCGTGACAC 


TCCACCCCTT 


1691 


TGTAGGGGCG 


GGGAGACAGA 


1761 


AATGAAACTT 


TTTTAATCTC 


1831 


TATCCTGGCG 


CCAAGATAAA 


1901 


AGCATAACGC 


GGCGATCTCC 


1971 


CTTACAAAAG 


TAATTAGTTC 


2041 


TATTTGCTAC 


GGCTGACCGC 


2111 


TCAGAAACCC 


AAGGCCCCCC 


2181 


CTTAGCTCTT 


TCTCGATAGT 



AACCGAGAAG CACTAGGACC 
ATCATCCTAG ATGATGTGTG 
CTTTTTTAAA TCTTTTTTTA 

tgtgtgtgtg tgtgtgtgac 
ctggag ttat ggat ggctgt 
actgtatEQ53*aaaaaa 

TCTAAGAGGA ATTGTTGATA 
TTTTACTGTC aaagtcaacc 
CACACACACA CACACACACA 
AGGGTAAAAG GAAACCAAAG 
GGGACAGACC TCCTTCGGAC 
AGAGGAGACA CAGTCCGAAA 
GTGGACACTT GAGTGTCATC 
GCCGCGGGGG AGCTACGAGA 
ACAAGTTTCG TCCGGGCTCG 
ACAACCAAAA GCCTTGACTC 
ACTTTAAGAA CCTGGCCGCG 
TTGCTTTCAG CCTCCAAGCT 
TACGCCGCCG CAATAAGGGT 
TCATACCAAC GTTTCGACTT 
TAG AC 



CCACCATCCT GTGGAACAGC ACAAGCAACC 850 

GGCGCGCACC TCATCCAAGT CTCTTCTAAC 920 

AATTTAAATT TTATGTGTGT GAGTGTTTTG 990 

ACTCCTGATG CCTGAGGAGG TCAGAAGAGA 1060 

GAGCCGGnnn GATAGGTCGG GACGGAGACC 1130 

TGATATTTCG GGAATTGTAG AGATTGTCCT 1200 

CGTAGTATAC TGTATATGTG TATGTATATG 1270 

TAGAGTGTCT GGTTACCAGG TCAATTTTAT 1340 

CACGTTTATA CTACGTACTGT TATCGGTAT 1410 

AGTGAGTGAT ATTATTGTGGA GGTGACAGA 1480 

TGTCTAAAAC TCCCCTTAGA AGTCTCGTCA 1550 

AGTTATTTTT CCGGCAAATC CTTTCCCTGT 1620 

CTTGCGCCGG AAGGTCAGGT GGTACCCGTC 1690 

ATCGACTCAC AGGGCGCCCC GGGCTTCGCA 1760 

GCGGACCTAT GGCGTCGATC CTTATTACCT 1830 

CGGTACTAAT TCTCCCTGCC GGCCCCCGTA 1900 

TTCTGCCTGG TCTCGCTTTC GTAAACGGTT 1970 

TCTGCTAGTC TATGGCAGCA TCAAGGCTGG 2040 

ACTGGGCGGC CCGTCGAAGG CCCTTTGGTT 2110 

TGATTCTTGC CGGTACGTGG TGGTGGGTGC 2180 



Fig. 2. Nucleotide sequence and the de- 
duced amino acid sequence of 4-1BB. The 
nucleotides of the message strand are num- 
bered in the 5' and 3' direction, and numbers 
are shown on both sides of the sequence. 
Nucleotide residue 1 is the adenine of the 
initiation codon ATG, and the nucleotides on 
the 5' side of residue 1 are indicated by 
negative numbers. The predicted amino acid 
sequence is shown below the nucleotide 
sequence. Putative signal peptide and the 
potential asparagine-linked glycosylation 
sites are underlined; the potential transmem- 
brane domain is double underlined. The 
potential polyadenylylation signal is boxed. 
N, ambiguous nucleotide sequences in the 3' 
untranslated region; — , stop codon; •, cys- 
teine residues. 



residues 1-3. This reading frame codes for a polypeptide of 
256 amino acids with a M T of 27,587. The assigned ATG is 
preceded by an in-frame termination codon TGA (nucleotide 
residues -12 to -9). The sequence flanking the assigned 
ATG (nucleotide residues -5 to 4) contains eight of nine 
residues identical to the consensus sequence (CCRC- 
CATGG, where R represents guanosine or adenine) de- 
scribed by Kozak (22). The codon specifying the carboxyl- 
terminal leucine is followed by the translational termination 
codon TGA (nucleotide residues 769-771). Clone 4-1BB 
contains an unusually long 3 '-untranslated sequence, which 
did not extend as far as the poly(A) + tail. A potential 
polyadenylylation signal is present at nucleotides 1158-1163 
(Fig. 2, boxed). This signal may sometimes be functional 
because this gene produces at least two different sizes of 
mRNA. Fig. 3 a and b shows RNA blot analysis of Con 
A-stimulated L3 RNA. When the blot was hybridized to an 
L3G25#4 probe that contained sequences to the 3' side of the 
polyadenylylation signal (nucleotides 1284-1557), the probe 
detected one RNA species of «2.4 kilobases (kb). When the 



same blot was hybridized to an L3G14#2 probe that con- 
tained sequences to the 5' side of the polyadenylylation signal 
(nucleotides 661-855) the probe detected two mRNA species 
of » 1.5 kb and 2.4 kb. Only one copy of the sequence ATTTA 
that is often present in multiple copies in the 3 '-untranslated 
regions of unstable mRNA was present in 4-1BB, and further 
studies of the stability of its mRNA are needed. Northern blot 
analysis has shown that 4-1BB mRNA is inducible in both L2 
and L3 cells and is present in several CTL clones and 
hybridomas but is absent in the EL-4 T-cell line (B.S.K., 
unpublished results). 

The deduced sequence of the first 23 amino acids of 4-1BB 
cDNA has characteristics of the signal peptide of secretory 
and membrane-associated proteins (23) and fits the -1, -3 
rule (24). We putatively assigned the first 23 amino acids as 
a signal peptide, although the presence of lysine at -4 and 
glutamic acid at -5 are somewhat unusual. Thus, the protein 
backbone of processed 4-1BB protein would be composed of 
233 amino acids with a M T of 25,167. Two potential aspar- 
agine-linked glycosylation signals (25, 26) are located at 

KWON000044 



1966 Immunology: Kwon and Weissman 



Proc. Natl. Acad. ScL USA 86 (1989) 



< < 

CL C 
h- O 

<0 « « 

* -i <o co O 

^ m j j j 



< 

Q. 



< 
C 

o 
o 



(0 

^ -j co co O 

^ UJ J J J 



«J 



2BS 



S8S 



18S i 



II 



L3 3 




Fig. 3. Expression of two different sizes of 4-1BB mRNA. Ten micrograms of poly(A) + mRNA from mouse B-cell line (K46), phorbol 
12-myristate 13-acetate-stimulated EL-4 (EL-4 TP A), and rat natural killer cell line (LGL) and 10 fig of total RNA from unstimulated L3 (L3) 
and Con A-stimulated L3 (L3 ConA) were fractionated on a 1.4% formaldehyde-agarose gel, transferred to GeneScreeniVwj, and hybridized 
to 32 P-labeled L3G25#4 (a), L3G14#2 {b\ and L3G20#3 (c) sequentially. L3G25#4 and L3G14#2 represent cDNA fragments of the 3' side 
and 5' side to boxed AATAAA sequence, respectively. L3G20#3 is an anonymous cDNA from L3 cDNA library and is used to show that each 
lane of the blot contains a similar amount of RNA. Positions of 28S and 18S rRNA markers are each indicated; arrows indicate the specific 
hybridization signals. 



amino acid positions 128 and 138 as underlined in Fig. 2. The 
predicted 4-1BB protein contained an unusually large number 
of cysteines. There are 23 cysteine residues in the putative 
mature protein as marked with dots in Fig. 2. These residues 
are arranged with a spacing reminiscent of that seen in several 
groups of proteins including zinc finger DNA-binding pro- 
teins (27), epidermal growth factor receptor (28), the Dro- 
sophila Notch locus product (29), and certain translation 
factors (30, 31). Amino acids flanking certain of the cysteine 
groups also resemble sequences found in other proteins. For 
example, with a replacement of leucine by isoleucine, there 
is an exact match to seven of eight amino acids of a putative 
zinc finger structure in the yeast elF-20 protein (31). These 
residues may represent metal binding sites, but, as discussed 
by others (32), their role in the functions of a protein cannot 
be uniquely predicted. 

Following the cysteine-rich regions of the protein is a 
stretch of amino acids (residues 140-185) in which almost 
30% of the amino acids are serine or threonine; these are 
potential sites for O-linked glycosylation and are reminiscent 
of those seen, for example, in the low density lipoprotein 
receptor (33). There is a stretch of 26 amino acids that 
constitutes a hydrophobic domain toward the carboxyl ter- 
minus of the protein (amino acids 186-211) (Fig. 4), followed 
by a sequence containing several basic residues. This region 
may serve as a membrane-spanning anchor domain. 

Finally, the relatively short carboxyl-terminal putative 
cytoplasmic domain contains both two short runs of three and 
four acidic residues, respectively, and a sequence of five 
glycines followed by a tyrosine. The short stretches of acidic 
residues are also found in the cytoplasmic domain of the low 
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density lipoprotein and atrial natriuretic peptide receptors, 
for example (33, 35). 

Southern Blot Analysis. As shown in Fig. 5, a fragment of 
L2G25B and of 4-1BB cDN A each detects a single-restriction 
fragment of ~15 kb and 18 kb in C57BL/6 and BALB/c 
DNA, respectively. The data indicated that the genes encod- 
ing the two molecules exist as a single copy in C57BL/6 and 
BALB/c mice. 

DISCUSSION 

We and others have developed protocols for differential 
screening of cDN A libraries by which one can detect a broad 
representation of the mRNA differentially expressed in two 
cell types (1-4). The method was applied to the systematic 
analysis of HTL and CTL gene expression and offered an 
alternative approach to classical protein purification for 
identifying effector molecules and genes. The genes for the 
T-cell antigen receptor, certain T-cell-specific effectors, and 
the probable gene mutated in an X chromosome-linked 
immunodeficiency syndrome of mice (36) were cloned and 
characterized in this fashion, and the set of clones described 
in this and a previous report were obtained before most of the 
individual species represented had been isolated by other 
methods. 

A number of lymphokine clones, including interleukins 2, 
3, and 5 and granulocyte/macrophage colony-stimulating 
factor as well as other T-cell-specific clones such as the serine 
esterases and T-cell receptors, were selected in the relatively 
small libraries (total of 18,000 L2 clones and 10,000 L3 
clones) whose screening we have reported here and else- 
where. Probably the bulk of the most abundant T-cell-specific 
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Fig. 4. Hydropathicity profile of the predicted amino acid sequence of 4-1 BB. Local hydropathicity values calculated by the method of Kyte 
and Doolittle (34) were plotted versus amino acid residues. Positive values indicate hydrophobic regions, and negative values indicate hydrophilic 
regions. 
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Fig. 5. Southern blot analysis of mouse genomic DNA. Genomic 
DNA from C57BL/6 (lanes 1 and 3) and BALB/c (lanes 2 and 4) was 
digested with EcoRl restriction enzyme, fractionated on a 0.8% 
agarose gel, transferred to GeneScreen/Vwj, and hybridized to 
32 P-labeled L2G25B (lanes 1 and 2) and 4-1BB (lanes 3 and 4). 

sequences were detected. Nevertheless, other potent bio- 
logic effectors may well have been missed. For example, L3 
secretes large amounts of interferon -y, but, for unknown 
reasons, we did not recover a cDNA clone for this molecule. 
No clone for any member of the RANTES family other than 
L2G25B was detected, even though mRNA for such genes 
may be among the most abundant mRNA species produced 
in T cells. Similarly, clones for other T-cell activation 
antigens and T-cell-specific membrane channel-forming pro- 
teins were not detected. 

Clones of the cDNA of 4-1BB were obtained from both the 
L2 and L3 cells. The clones corresponding to L2G25B were 
all isolated from L2 cells, and the possibility remains that the 
mRNA sequences detected in L3 corresponded to a different 
member of this gene family. The two species of cDNA 
reported here represent the two abundant, inducible se- 
quences detected in both the HTL and CTL clones used in the 
present study. The induction properties of both clones 
resembled those of other lymphokines in that induction could 
be detected within 30 min after stimulation; they could both 
be induced either by Con A or by antigen stimulation. Unlike 
interleukin 2 receptor mRNA (37), induction of both mRNA 
species was blocked by treatment with cyclosporine A 
(B.S.K., unpublished work). Because 4-1BB protein was 
inducible in both T-cell clones, it may play roles in the T-cell 
activation process and function of the activated T cells less 
specific to a certain T-cell subset than those of the well- 
described lymphokines. As discussed above, 4-1BB has a 
number of properties reminiscent of membrane receptors and 
transmembrane signaling proteins. However, in at least the 
instances of the Drosophila Notch and sevenless loci (29, 38), 
an apparent transmembrane protein seems capable of acting 
directly or indirectly in trans on other cells. 
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